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1 .  INTRODUCTION 


i 

I 


Anderson  and  Duffin  [2]  have  introduced  the  concept  of  "parallel 
sum"  of  a  pair  of  matrices  and  have  deduced  interesting  and  impor¬ 
tant  properties  of  this  operation  when  the  matrices  concerned  are 
non-negative  definite.  They  were  led  to  this  concept  from  a  paral¬ 
lel  connection  of  "resistors"  through  a  vectorial  generalization  of 
Kirchoff's  and  Ohm's  laws  in  which  resistors  become  non-negative 
definite  linear  operators.  The  concept  of  parallel  sum  was  extended 
and  its  elegance  further  demonstrated  by  Rao  and  Mitra  [19]  who 
showed  that  most  of  the  properties  proved  by  Anderson  and  Duffin  [2] 
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are  indeed  true  for  a  much  wider  class  of  pairs  of  matrices, 

designated  by  these  authors  as  "parallel  summable" .  Similar 

extensions  of  the  concept  of  "hybrid  sum" ,  introduced  by  Duff in 

and  Trapp  [8]  in  analogy  with  a  hybrid  connection  of  resistors, 

were  made  by  Mitra  and  Trapp  [17] .  The  object  of  this  paper  is 

to  offer  a  comparable  extension  of  the  notion  of  a  shorted  oper- 

(3) 

ator  studied  by  Anderson  [1]  ,  by  Anderson  and  Trapp  [3]  and 

the  present  authors  [14] .  The  key  point  in  this  development  is 
a  theorem  of  Anderson  and  Trapp  which  exhibits  the  shorted  n.n.d. 
matrix  as  the  limit  of  a  sequence  of  parallel  sum  matrices.  We 
also  describe  some  applications  of  the  shorted  operator  in  mathe¬ 
matical  statistics. 


2 .  PRELIMINARIES 

Before  we  proceed  any  further  let  us  record  here  some  known 
results  on  the  parallel  sum.  We  follow  the  same  notation  as  in 
our  earlier  paper  [14] . 


Definition  2.1.  Matrices  A  and  B  in  C111*11  are  said  to  be 
parallel  summable  [p.s.[l9])  if  A(A  +  B)”B  is  invariant  under 
the  choice  of  the  generalized  inverse  (A  +  B)~  .  If  A  and  B 
are  p.s.,  P(A,B)  =  A (A  +  B)  ~B  is  called  the  parallel  sum  of 
A  and  B  . 

The  following  theorem  is  proved  in  [19], 


(3) 


See  also  Krein  CIO]. 
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Theorem  2.1.  A  and  B  are  p.s.  iff 

M  (A)  c  M  (A  +  B)  ,  M  (A*)  c  M (A*  +  B*) 
or  equivalently 

M(B)  c  M (A  +  B)  ,  M (B*)  c  M (A*  +  B*) 

Theorem  2.2  lists  certain  known  properties  of  the  parallel  sum 
[2,13,19]  . 

Theorem  2.2.  If  A  and  B  are  p.s.  matrices  in  cmxn  ,  then 


(a) 

P  (A, B)  =  P  (B,  A)  , 

(b) 

A*  and  B*  are  also  p, 

>  s .  and 

P  (A*  ,B*)  = 

[P  (A, B)  ]  *  , 

(c) 

P (A, B)  is  n.n.d.  when 

m  =  n 

and  A,B 

are  n.n.d., 

<d) 

for  C  of  rank  m  in 

cpxm 

CA  and  CB 

are  p.s. 

and  P(CA,CB)  =  CP(A,B) 

(e) 

{ [P (A,B) ] ~}  =  {A"  +  B"} 

(f) 

M  [P  (A,  B)  ]  =  M(A)  n  M(B) 

• 

(g) 

if  P*  is  the  orthogonal  projector  onto 

M (A*)  n  M(B*) 

and  P  is  the  orthogonal  projector  onto  M(A)  n  M (B)  , 
then  [P(A,B)]+  »  P* (A~  +  B_)P  . 

(h)  P[P(A,B),C]  =  P [A,P (B,C) ]  if  all  the  parallel  sum 
operations  are  defined. 

(i)  if  P  and  P  are  the  orthogonal  projectors  onto 

A  D 

H(A)  and  M(B),  respectively,  then  the  orthogonal  pro¬ 
jector  onto  M (A)  n  M(B)  is  given  by  P  =  2P(PA#PB) 

Definition  2.2.  If  S  is  a  subspace  of  Em  and  A  e  Cm  ,  the 
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shorted  matrix  S (A)  is  the  unique  matrix  in  C  such  that 

m 


M  [3  (A)  1  c  S 
A  *  S(A) 

if  C  e  C  .  A  ^  C  and  M (C)  =  S  then  S( A)  *  C  . 
m 

The  existence  of  S( A)  was  established  by  Anderson  and  Trapp [3]  . 


Theorem  2.3.  Let  A,B  e  Cm  and  M (B)  =  S  ,  then 

S (A)  =  lim  i  P(XA,B) 

X-*0  A 

Theorem  2.3  was  proved  by  Anderson  and  Trapp  for  the  special  case 
where  B  is  the  orthogonal  projector  (Theorem  12  [3]).  The  gen¬ 
eral  case  could  be  proved  on  same  lines.  Alternatively,  a  direct 
proof  could  be  constructed  using  simultaneous  diagonalization  of 
the  pair  A,B  of  n.n.d.  matrices  (see  e.g.  Theorem  6.2.3,119]). 


3.  The  Shorted  Matrix  -  An  Extended  Concept 

Let  the  pair  of  matrices  A,B  in  Cmxn  be  p.s.  and 

lim  A ( XA  +  B)~B  =  C  (3.1) 

X+0 


exist  and  be  finite. 

Theorem  3.1  gives  certain  properties  of  the  C  matrix  when 
it  exists. 
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Theorem  3. 

1. 

(a) 

C  =  lim  B(XA  +  B)“A 

X-*0 

(3.2) 

(b) 

M  (C)  c  M(A)  n  M(B) 

M(C*)  c  M (A*)  n  M(B*) 

(c) 

M(a  -  C)  is  virtually  disjoint  with 

M(A*  -  C*)  with  M (B*) 

M(B)  and  so  is 

(d) 

C  and  A  -  C  are  disjoint  matrices 

M (A)  *  M(C)  •  M(A-C) 

[12] ,  that  is 

M(A*)  =  M (C*)  •  M (A*  -  C*) 

(3.3) 

(e) 

M(C)  =  M(A)  n  M(B) 

M (C* )  =  M(A*)  n  M(B*) 

(3.4) 

(f) 

Let  E  be  any  matrix  such  that  M (E) 

M (E*)  c  M(B*)  ,  then 

Rank  (A  -  E)  2  Rank  (A  -  C) 

c  M  (B)  , 

the  sign  of  equality  holding  if  and  only  if  E  *  C  . 

Proof:  (a) 

.  If  A  and  B  are  p . s . ,  X A  and 

B  are  p.s.  for 

each  X  sufficiently  small.  Hence 

C  *  lira  Y  P ( XA,B) 

X-0  A 

Since  P ( AA,B)  =  P(B,XA)  by  Theorem  2.2(a),  (3.2)  follows, 
(b) .  Consider  a  typical  vector  Cx  in  M(C) 

Cx  *  lim  y. 

X-*-0  ~ 

=  a ( XA  +  B)~Bx  e  M(A)  .  Since  M(A)  is  closed, 

Cx  »  lim  y.  e  U  (A)  . 

X-»0  ~A 


Where  y^ 
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Similarly  using  (3.2)  we  have  Cx  e  M(B)  and  hence 

Cx  e  M (A)  n  M (B)  .  The  other  part  of  (b)  is  established  in  a 

like  manner. 

(c)  .  Let  (A  -  C)u  =  By  be  a  vector  in  M (A  -  C)  n  M(B)  .  Then 
Au  =  Cu  +  Bv  . 

Hence  Cu  =  lim  B(XA  +  B) ~Au  =  lim  B(XA  +  B)“(Cu  +  Bv) 

X-+0  ~  X^O 

lim  B(XA  +  B)  Bv  =  lim  XA(XA  +  B)-Cu  (3.5) 

X+0  A+0 

Since  C  =  BK  for  some  matrix  K  in  Cnxm  t  the  RHS  of  (3.5)  is 
seen  to  be  equal  to  the  null  vector ,  while 

B  =  B(XA  +  B)"(XA  +  B) 

for  each  X  sufficiently  small  implies  on  taking  limits  of  both 
sides  that 

B  =  lim  B ( XA  +  B)~B 
X->0 

Hence  Bv  =  0  and  the  first  part  of  (c)  is  established.  The 
proof  of  the  second  part  is  similar. 

(d)  .  (d)  is  a  simple  consequence  of  (c) . - 

(e)  .  Let  x  e  M(A)  n  M(B)  .  Using  (3.3)  we  write 

;  =  *i  +  *2 

where  x^  e  M(C)  and  X£  e  M(A  -  C)  .  Observe  that 
x2  -  x  -  Xj^  e  M(B)  and  hence  6  M(B)  n  M(A  -  C)  .  This  implies 
*2  *  0  and  x  =  x-j^  e  M(C)  .  This,  in  view  of  (b)  establishes 
the  first  part  of  (e) .  The  other  part  is  similarly  deduced. 

(f)  In  view  of  (c) ,  the  expression 
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A  -  E  =  (C  -  E)  +  (A  -  C) 

exhibits  A  -  E  as  the  sum  of  two  disjoint  matrices  C  -  E 
and  A  -  C  .  Hence 

Rank  (A  -  E)  =  Rank  (C  -  E)  +  Rank  (A  -  C) 

This  concludes  the  proof  of  (f)  and  of  Theorem  3.1. 

The  matrix  C  will  henceforth  be  called  the  matrix  A 
shorted  by  the  matrix  B  and  denoted  by  S(A|B)  .  Theorem 
3.2  gives  some  more  properties  of  the  shorted  matrix. 

Theorem  3.2.  Let  S(AjB)  exist.  Then 

(a)  S(A*|B*)  also  exists  and  S(a*|b*)  =  [S(A|B)]*  , 

(b)  if  K  e  CpXm  and  Rank  K  -  m,  S(KA|KB)  exists 
and  S(KA|KB)  -  KS(A|B) 

(c)  if  m  =  n  ,  A  is  n.n.d.  and  further 

M(B)  n  M(A)  =  M(B*)  n  M(A)  (3.6) 

S(A|B)  is  n.n.d. 

Proof:  Proofs  of  (a)  and  (b)  are  straightforward  and  are  omitted, 
(c) .  Observe  that  by  the  (a)  part  if  C  -  S(A|B),  then 
C*  *  S (A |B*) .  Further,  if  (3.6)  holds  both  C  and  C*  have 
identical  row  and  column  spans.  Application  of  theorem  3.1(f)  now 
shows  C  =  C*  . 

Choose  and  fix  a  hermitian  g-inverse  A~  of  A  .  If  for  some 
x,  x*  C  x  ^  0  , 

x*CA-AA”Cx  =  x*  C  x  <  0  . 

The  equality  follows  from  the  fact  that  A  AA  is  a  g-inverse  of 


•^W»PYf!OW»l 
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A  and  A  being  the  sum  of  disjoint  matrices  C  and  A  -  C 
every  g-inverse  of  A  is  a  g-inverse  of  C  .  This  contradicts 
the  assumption  that  A  is  n.n.d. ,  and  establishes  the  claim  in  (c) 

That  (c)  is  not  true  in  general  can  be  seen  from  the  follow¬ 
ing  counter  example.  Let  I  and  H  be  the  identity  matrix  and 
an  idempotent  matrix  in  CmXin  .  it  is  not  difficult  to  see  that 
S ( I | H)  exists  and  is  equal  to  H  .  H  need  not  be  hermitian, 
and  it  is  conceivable  that  a  vector  x  might  exist  such  that 
x*H  x  <  0  .  Consider  the  idempotent  matrix 


for  such  a  counter  example. 

Remarks.  A  matrix  A  e  Cm*m  is  said  to  be  almost  positive 
definite  (a.p.d. )  [7, 11]  if  V  x  e  Em  ,  Re  (x*  A  x)  a  0  and 
x*  A  x  =  0  Ax  =  0  .  Unlike  n.n.d.  matrices,  an  a.p.d. 
matrix  need  not  be  hermitian.  Similar  to  theorem  3.2(c),  we 
can  prove  that  if  A  is  a.p.d.,  S(A|B)  exists  and  (3.6) 
holds,  then  S(A]B)  is  a.p.d. 

This  can  be  proved  as  follows.  Since  A  is  a.p.d.,  it 
follows  as  in  Theorem  2  [11]  that  A  is  an  EP  matrix  (that 
is  M(a)  =  M(A*)).  Equations  (3.4)  and  (3.6)  therefore  imply 
that  C  =  S(AjB)  is  EP  .  Further  as  in  Corollaries  2  and  3 
of  [11],  A+  is  seen  to  be  a.p.d.  Since  C  is  EP 

C[I  -  A+C]  =  0  =*>  CM!  -  A+C]  =  0  =>  C*  =  C*A+C 
If  Re  (x*Cx)  <  0  ,  then  Re(x*C*x)  =  Re(x*C*A+Cx)  <  0  , 
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which  contradicts  almost  positive  definiteness  of  A+  .  Also, 

x*  C  x  =  0  =t>  A+Cx  =  0=^  Cx  =  0. 

A  matrix  A  e  Cm*m  is  said  to  be  positive  semidefinite  (p.s.d.) 
if  V  x  £  Em,  Re(x*  Ax)  >  0  .  When  A  is  p.s.d.  and  t3.6) 
holds,  S(A|B)  if  it  exists  is  also  p.s.d.  This  can  be  proved 
on  similar  lines. 

Let  the  matrices  A  and  B  be  p.s.  and  A  +  B  be  of  rank 
r  .  Consider  a  rank  factorization  of  A  +  B: 

A  +  B  =  LR 


and  the  representations 

A  =  LDR  ,  B  =  L ( 1  -  D)  R  (3.7) 

implied  by  parallel  summability,  where  D  and  I  -  D  are  square 
matrices  in  C  .It  is  easily  seen  that  in  any  such  repre¬ 
sentation,  the  matrix  D  (and  naturally  I  -  D)  is  uniquely  deter¬ 
mined  up  to  a  similarity  transformation.  Theorem  3.3  gives  a 
necessary  and  sufficient  condition  for  S(A|B)  to  exist. 

Theorem  3.3.  Let  A  and  B  be  p.s.,  S (A | B)  exists  iff  I  -  D 
is  of  Drazin  index  1,  that  is 

Rank  (I  -  D) 2  =  Rank  (I  -  D) 
or  equivalently  Rank  B (A  +  B)B  =  Rank  B 

Proof:  Without  any  loss  of  generality  we  assume  that  D  is 
already  in  Jordan  canonical  form,  and  write  D  as  the  sum  of 
two  disjoint  matrices  and  D2 ,  each  of  order  r  x  r  where 


(3.8) 

(3.9) 
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is  identical  with  D  everywhere  including  in  all  its 
diagonal  Jordan  blocks,  except  for  the  Jordan  block  corres¬ 
ponding  to  the  eigen  value  1  ,  if  any,  which  is  replaced  by 
a  null  matrix  and  D2  =  D  -  D1  .  The  above  assumption  (3.8) 
implies  that  (I  -  D)  is  disjoint  with  D2  •  Hence 

A ( AA  +  B)"B  =  LD ( AD  +  I  -  D)"(I  -  D) R 

=  L(D1  +  D2) [AD1  +  (I  -  D)  +  AD2)"(I  -  D) R 
=  LD1  (ADL  +  (I  -  D)  +  AD2]”(I  -  D) R 
=  LD1fAD1  ♦  (I  -  D)  )  ~  (I  -  D)  R 
=  LD-jR  -  ALD1  [  ADX  +  (I  -  D)  ]  “D-jR 

Taking  limit  as  A  -*■  0  we  have 

lim  A( AA  +  B)  ~B  =  LD. R 
A-K)  L 

This  concludes  the  proof  of  the  'if'  part. 

Assume  now  that  (3.1)  holds  and  consider  the  representation 

A  +  B  =  (B  +  C)  +  (A  -  C) 

where  B  +  C  and  A  -  C  are  disjoint  matrices.  Since  A  and 
B  are  p.s.,  B  +  C  has  the  same  row  and  column  spans  as  that  of 
B  .  Consider  rank  factorizations  of  B  +  C  and  A  -  C  : 

B  +  C  = 

A  -  C  =  L2R2 

leading  to  the  rank  factorization  A  +  B  =  LR  where 

/R1 

L  =  (L^  :  L2)  and  R  =  I 

\R2 

If  B  =  L^FR^  ,  clearly  F  is  nonsingular.  Hence  I  -  D  =  diag(F,0) 
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is  of  Drazin  index  1.  The  last  part  of  Theorem  3.3  is  trivial. 
This  concludes  proof  of  Theorem  3.3. 


Theorem  3.4.  If  S(A|B)  is  defined,  then  S[A|S(A|B)]  is 
also  defined  and 

S[A]S(A|B)1  =  S (A  j  B)  (3.10) 

Proof:  The  proof  is  fairly  straightforward  and  is  omitted. 

Theorem  3.5.  A  general  solution  to  a  g-inverse  of  S(A|B)  is 
A-  +  Xb  where  A~  is  an  arbitrary  g-inverse  of  A  and 
is  any  arbitrary  solution  of  the  homogeneous  equation 

BXfcB  =0  (3.11) 

Proof:  Every  g-inverse  of  A  is  a  g-inverse  of  S(A|B)  .  This 
we  have  already  noted  while  proving  theorem  3.2.  Consider  now  a 
matrix  A~  +  X,  as  determined  above 

D 

S(A|B)(A"  +  Xb)  S(a|B)  =  S (A  j  B)  +  S(A(B)XbS(A|B)  =  S(A|B) 
on  account  of  Theorem  3.1(b).  This  shows 

(A_  +  Xb}  c  {[S(A|B)]"} 

Now  choose  and  fix  a  g-inverse  G  of  A  .  A  general  solution 
to  a  g-inverse  of  S(A)B)  is  G  +  X  where  X  is  a  general 

solution  to  the  homogeneous  equation 

S(AjB)X  S(AjB)  =  0  . 

5 

Theorem  3.1(e)  implies  that  any  such  matrix  X  can  be  written 

5 


12 


as  Xs  -  Xa  +  Xb  where  xa  an<*  Xb  satisfy  respectively  the 
equations 

AX  A  =  0  ,  BX,  B  =  0  . 
a  b 

The  matrix  G  +  X  e  {A~}  .  Hence 

a 

{ [S (A | B) ”}  c  (A_  +  Xb) 

Theorem  3.5  is  thus  proved. 

Theorem  3.6.  (a)  If  S(AjB)  and  S(B|A)  are  both  defined, 

then  S (A|B)  and  S(B|A)  are  p.s.  and 

P [S (A | B) ,  S(B|A) ]  *  P(A,B)  (3.12) 

(b)  {[S(AlB) ]"  +  [S(B|A) ]'}  =  (A_  +  B")  (3.13) 

Proof:  (a).  From  theorem  3.3  it  is  seen  that  S(B|A)  will  exist 

2 

iff  Rank  D  =  Rank  D  .  Let  (I  -  D) ^  denote  a  matrix  which  is 
identical  with  I  -  D  everywhere  except  for  the  diagonal  block 
corresponding  to  the  zero  eigen  value  of  D  ,  which  is  replaced 
by  a  null  matrix.  As  in  the  proof  of  Theorem  3.3,  it  is  seen  that 

S  (B  j  A)  =  lim  A(XB  +  A)  _B  =  L(I  -  D),R 
X-*-0 

When  both  S{A|B)  and  S(B)A)  exist  it  is  seen  that  D1  and 
(I  -  D) ^  which  are  both  block  diagonal,  have  nonnull  diagonal 
blocks  at  identical  positions,  each  such  nonnull  pair  adding  up 
to  an  identity  matrix  of  same  order  as  of  the  diagonal  block 
concerned.  This  shows  that  S(A|B)  and  S ( B | A)  are  parallel 
summable  and 
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P[S(A|B) ,S(B|A) ]  =  LDX(I  -  D)lR  =  LD (I  -  D)R 

=  P (A, B) 


(b) .  Clearly  {A-  +  B~}  c  {[S(A|B)]~  +  [S(B|A)]"}  . 

Conversely  by  theorem  3.5,  [S(A|B)]~  can  be  written  as  G  +  X. 

cl  ] 

where  Gg  €  {A-}  and  BXbB  =  0  .  Similarly  [S (B | A) ] ”  can  be 
written  as  Gfa  +  X&  where  Gb  e  {B~}  and  AXflA  =  0  .  Hence 


[S (A | B) ]  +  [S ( B | A) ]  =  Ga  +Xb  +Gfa  +  Xfl 


-  (Ga  +  Xa)  .  (Gb  +  X,,)  . 


This  shows  {  [ S  (A |  B)  ]  +  [ S  (B  |  A)  ]  }  <=  {A  +  B  }  which  concludes 
the  proof  of  the  part  (b)  and  of  theorem  3.6. 


Theorem  3.7. 

S [P (A, B)  j C]  *  P[S(A|c) ,B3  =  PCS(B)C) ,A]  (3.14) 

when  the  parallel  sum  and  shorted  matrices  involved  are  defined. 
Proof.  Using  Theorem  3.5  and  Theorem  2.2(e) 

{ (S [P (A,B) |C] )"}  -  {A"  +  B“  +  Xc> 

{ (P[S(A|C) ,B] )“}  =  {A_  +  Xc  +  B~} 

{ (P[S(B|C) ,A] )"}  =  {B-  +  Xc  +  A“} 

This  shows  the  three  matrices  in  (3.14)  have  identical  general 
solutions  for  a  g-inverse.  Since  a  matrix  is  uniquely  determined 
by  its  class  of  g-inverses  (Theorem  2.4.2,  [19])  (3.14)  is  estab¬ 
lished. 

The  following  theorem  can  also  be  proved  using  a  similar 


argument.  We  omit  the  proof 
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Theorem  3.8. 

S[S(A|B)|CJ  -  S[A|S(B|C)]  =  S[A|S(C|B)) 
=  S  (A  j  P  (B  ,C)  ] 


(3.15) 


when  the  parallel  sum  and  shorted  matrices  involved  are  defined. 


4 .  Another  Approach 

In  view  of  Theorem  3.1(f),  one  is  tempted  to  put  forward  the 
following  definition  of  the  shorted  matrix  imitating  the  Anderson- 
Trapp  definition  given  in  Section  2. 

Let  A  be  a  given  matrix  in  Cmxn  and  S,T  be  given  sub¬ 
spaces  in  Em,En  respectively.  The  shorted  matrix  S(A| S,T)  is 
a  matrix  C  in  Cm*n  ,  such  that 

(a)  M(C)  c  S  ,  M(C*)  c  f  (4.1) 

(B)  if  E  e  Cmxn  ,  M(E)  c  S  ,  and  M(E*)  c  T  , 

(4.2) 

then  Rank  (A  -  E)  >  Rank  (A  -  C) 


This  definition  however  does  not  always  lead  to  a  unique  answer. 
Consider  for  example  the  matrix 
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It  is  seen  that  for  arbitrary  scalars  a  and  b  the  matrix 

10  0 
a  b  0 

0  0  0 

satisfies  the  conditions  required  of  the  matrix  C  in  the  above 
definition.  The  following  theorem  gives  necessary  and  sufficient 
conditions  on  the  triplet  (A,S,T)  so  that  S(A[ S,T)  may  exist 
uniquely. 

Let  S  and  T  be  column  spans  of  matrices  L^  and  R^* 
respectively, and  let  the  columns  of  L 2  and  R2*  span  respectively 
complementary  subspaces  of  S  in  Em  and  of  T  in  En  .  We 
assume  that  ,L2 ,R^* , R2*  are  of  full  column  ranks.  Let  us  write 


A 


<W 


11 

W12 

21 

W22 

(4.3) 


Theorem  4.1.  The  shorted  matrix  S(A|S,T)  exists  and  is  unique 
iff 


M(W21)  c  M(W22)  ,  tf(W12)  c  M(W*2)  (4.4) 

When  (4.4)  is  satisfied 

S(A|S,T)  =  -  «12W22W21)R1 

Proof:  The  required  conditions  are  seen  to  be  independent  of 
the  specific  choice  of  matrices  L^,L2,R^  and  R2  in  the  sense 
that  if  the  conditions  are  met  for  one  choice,  they  would  also 
be  met  for  an  alternative  choice. 
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Assume  now  that  (4.4)  holds  and  write 


A  =  L1W11R1  +  L1W12R2  +  L2W21R1  +  L2W22R2 

-  Ll(wn  -  w12w-2w21)Rl  +  (Liw12  ♦  l2w22mr2  ♦  w22w2lRl) 

L1(W11  "  W12W22W21)R1  +  (L1W12W22  +  L2* (W21R1  +  W22R2J 


=  Ai  +  A2 


say 


(4.5) 


Clearly  M(A^)  c  S  ,  M(A*)  c  T  .  We  shall  show  that  M(A2)  is 

* 

virtually  disjoint  with  S  and  M(A2)  with  T  .  Let 


A2~  =  *LlW12  +  L2W22}  ^R2  +  W22W21R1)~ 

=  (L1W12  +  L2W22)y 

be  a  vector  in  M(A2)  n  M ( L ^ )  .  Then  L22W22y  *  0 

=*>  W22Y  ~  0  *  W12^  =  0  °n  aCcount  of  (4.4) 
a2x  =  0  , 

That  M (A*)  is  virtually  disjoint  with  T  is  similarly  estab¬ 
lished.  That  the  matrix  A^  satisfies  also  the  condition  (4.2) 
required  of  the  C  matrix  and  is  the  unique  matrix  to  do  so  fol¬ 
lows  as  in  the  proof  of  theorem  3.1(f). 

Conversely  suppose  that  there  exists  a  unique  matrix  C  satis¬ 
fying  (4.1)  and  (4.2).  This  implies  that  M (A  -  C)  is  virtually 
disjoint  with  S  and  M  (A  -  C  )  with  T  .  Let  C  *  » 

A  -  C  *  L21R21  be  rank  factorizations  of  C  and  A  -  C  respec¬ 
tively.  Let  the  columns  of  L^  *  (L^sL^)  provide  a  basis  for 
5  and  those  of  R*  =  (r^:R*2)  a  basis  for  T  .  Further  let  the 

columns  of  (L^:L2^:L22)  form  a  basis  of  Em  and  those  of 

*  *  *  ,  n 
(R1:R21sR22  ^  a  bas*s  for  E 


Let  us  write  L2  for  the  matrix 
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(L2i:Ii22)  and  R2  for  the  matrix  (R21:R22)  .  Clearly  by 
construction  the  matrices  C  and  A  -  C  are  of  the  form 
C  =  I'j.Wj.jRj.  ,  A  -  C  =  L2W22R2  wh*ch  shows  that  in  a  repre¬ 
sentation  of  the  type  (4.3)  ,  W^2  =  0  ,  W21  *  0  and  the  condi¬ 
tion  (4.4)  is  trivially  satisfied. 

In  [17],  Mitra  and  Trapp  defined  the  generalized  shorted 
operator  as  the  strong  hybrid  sum  of  A  with  the  null  matrix. 
Theorem  4.1  is  closely  related  with  this  definition.  The 
reader  is  also  referred  to  Theorem  2  in  Carlson  [5]  which  con¬ 
siders  decompositions  of  the  matrix  A  with  a  somewhat  differ¬ 
ent  emphasis. 

When  A  is  a  square  matrix,  that  is  m  =  n  and  S  *  T  , 
condition  (4.4)  is  seen  to  be  equivalent  to  the  condition  that 
A  be  S1  complementable  (Ando  [4])  where  denotes  the 

orthogonal  complement  of  S  .  Further 

S(A|S,S)  =  A/Sl 

the  generalized  Schur  complement.  The  verification  is  fairly 
straightforward.  For  the  case  m  /  n  ,  the  notion  could  be 
extended  as  follows: 

Let  M  ,  N  be  given  subspaces  in  Em,En  respectively  and 
following  Ando's  notation  let  1^  and  1^  denote  respectively 
the  orthogonal  projectors  onto  M  and  W  under  the  respective 
dot  products. 

Definition.  The  matrix  A  is  said  to  be  M,N  complementable 
if  there  exists  matrices  e  Cn'*n'  ,  e  Cnxn  such  that 
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When  (4.6)  and 


VM  *  M£  '  Vr 

IMANr  =  V  '  M*AI 
(4.7)  are  satisfied, 


N 


N  ~  AIN  * 


we  have 


(4.6) 

(4.7) 


MgA  =  MjlANr  =  ANr  (4.8) 

M  AN  which  clearly  depends  only  on  M,N  and  A  is  called 

A/  *• 

the  Schur  compression  of  A  and  denoted  by  A^  ^  .  A  -  M£ANr 
is  called  the  generalized  Schur  complement  of  A  and  denoted 

by  A/M,N  ' 


Theorem  4.2.  Let  A  be  M,N  complementable.  Then 

A/M  N1  =  S  (Af  M1  /N1)  (4.9) 

Proof :  Observe  that 

V*  -  W  -  1M(A  -  *V  *  0 

(A  -  M^ANr)  Ijy  .  (A  -  MtA)IN  =  0  . 

These  show  that 

M(A/M,N*  c  M  '  Ml-  c  N 

Further  if 

M)lANra  -  ANra  e  M(AM>W)  n  M^ 

AN  a  =  I  ..AN  a  *  0  .  This  shows  M(A,.  ..)  is  virtually  disjoint 
r  Mr  m  t  n 

with  M  .  Similarly  WC (AM  N)  *  ]  is  seen  to  be  virtually  dis- 


joint  with  .  Hence  A 


w  is  seen  to  be  the  unique  matrix 
satisfying  (4.1)  and  (4.2)  with  S  ■  AT*-  ,  T  *  W*-  . 

For  an  application  of  Theorem  4.1  consider  the  following 
problem.  Let  A,B  £  Cm*n  and  A  denote  the  matrix 

“ ) 

A  +  B  / 

Put 

Hoi  *h:i 


By  Theorem  4.1,  S(AjS,T)  exists  uniquely  iff 
M (A)  c  M (A  +  B)  ,  M (A* )  c  M (A*  +  B*)  ,  that  is,  if  A  and  B 

are  p.s.  Further,  if  this  condition  is  satisfied 


S(AIS,T) 


A  (A  +  B)”A 

°) 

m  j 

r  p  (a  ,  b) 

°) 

(4.10) 

0 

°l 

1 

i  0 

°l 

Anderson  and  Trapp  [3]  have  used  (4.10)  to  define  the  parallel 
sum  of  hermitian  n.n.d.  matrices  through  the  concept  of  a  shorted 
operator. 


i 
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5 .  Computation  of  the  Shorted  Matrix 

Let  A  c  Cmxn  ,  X  e  CmXp  ,  Y  £  CqXn  .  Let  F  denote  the 

(A  X  \  x 

1  where  0  is  the  null  matrix  in  Cq  p  .  The 
Y  0/ 

following  result  can  easily  be  established.  We  omit  the  proof. 

Lemma  5.1.  Condition  (4.4)  is  equivalent  to  each  of  the  follow¬ 
ing  conditions. 

(a)  Rank  F  =  Rank  (A  X)  +  Rank  Y  =  Rank  +  Rank  X  (5.1) 


(b)  M  g  c  M (F)  ,  M  J*)  c  M(F*) 


(5.2) 


Let  G  = 


fci  c2 

^C3  “C4 


(F"} 


Lemma  5.2.  If  condition  (5.1)  or  equivalent  (5.2)  holds,  then 


(i) 

XC3X  =  X 

,  YC2Y  =  Y 

(ii) 

YC^X  =  0 

,  A ,C  X  =  0  ,  YCjA  =  0 

(iii) 

ac2y  =  XC 

,A  =  XC.Y 

3  4 

(iv) 

ACjACjA  = 

AC^A  ,  Tr  AC^  =  Rank  (ArX)  -  Rank  X 

(v) 


=  Rank 


=  Rank  Y 


j^j  £  {(A:X)"}  ,  (C^C,,)  £  {  J 


} 


Proof:  Lemma  5.2  except  for  the  second  part  of  (iv)  follows  from 
the  following  equations 


FGF  =  F 


(5.3a) 


FG 


(5.3b) 


! 

i 
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(A: 0) GF  =  (ArO)  (5.3c) 

the  last  two  equations  being  consequences  of  (5.2). 

Since  G  €  {F-}  ,  C 3  c  {X-}  ,  C 2  e  {Y-}  , 

Rank  F  =  Rank  FG  =  Tr  FG  =  Tr  (ACX  +  Xc3)  +  Tr  YC2 

=  Tr  AC^  +  Rank  X  +  Rank  Y  . 

The  second  part  of  (iv)  therefore  follows  from  (5.1) 

Theorem  5.1.  Let  S  =  M (X)  ,  T  =  M(Y*)  .  If  condition  (5.1)  or 

equivalently  (5.2)  holds,  the  matrix 

XC .Y  =  AC.Y  =  XC-A  (5.4) 

4  2  3 

is  the  unique  shorted  matrix  (AjS,L)  . 

Proof:  (5.3b)  =«> 

AC^A  +  XC3A  =  A  (5.5) 

Since  A  -  XC3A  =  AC^A  ,  it  suffices  to  show  that 

M(ACxA)  n  M(X)  =  (0}  (5.6a) 

MUACjA)*]  n  M(Y*)  =  (0}  (5.6b) 

If  ACjAa  =  Xb  e  M(AC1A)  n  M(X)  ,  then 

AC-jAa  =  AC^AC-^Aa  =  AC^Xb  =  0  =*>  (5.6a). 

(5.6b)  is  similarly  established.  The  rest  of  the  proof  of  Theorem 
5.1  is  similar  to  the  proof  of  the  'if'  part  of  Theorem  4.1. 
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Remark  1.  Lemma  5.2  for  the  special  case  where  A  is  a  real 
symmetric  n.n.d.  matrix  and  Y  =  was  found  by  Rao  [18]  . 

That  C2  in  such  a  case  is  a  minimum  A  seminorm  g-inverse  of 
Y  was  shown  in  [19,  Corollary  1,  p.  47]  .  Using  Theorem  2.1  of 
Mitra  and  Puri  [14]  it  is  seen  that  AC2Y  =  S (A)  where 
S  =  M (X)  .  It  is  remarkable  that  in  the  general  case,  the  same 

formula  also  provides  the  shorted  matrix  S(A|S,T)  when  in  no 
conceivable  way,  can  C2  be  interpreted  a  minimum  A  seminorm 
g-inverse  of  Y  . 

Remark  2.  Formula  (5.4)  appears  to  be  direct  and  therefore  simplier 
to  compute  compared  to  the  expression  given  in  Theorem  4.1,  as  it 
does  not  require  the  identification  of  complementary  subspaces  of 
S  and  T  and  determination  of  the  W..  matrices  in  (4.3). 


Theorem  5.2.  Let  the  matrices  A  and  B  in  cmXn  be  p.s.  and 
in  addition  let  (3.8)  or  equivalently  (3.9)  hold,  then  the  matrix 


F 


satisfies  condition  (5.1)  and  if  G 


€  {F'}  , 


then  AC2B  =  S(a|b)  . 
Proof :  Rank  F  =  Rank 


A+B 

B 


=  Rank 


(A+B 
0 


B  -  ) 
-B  (A+B)  B  J 


since  (B*)  c  (a*  +  B*)  . 

The  RHS  is  further  equal  to 


Rank  (A+B)  +  Rank  [B(A  +  B)  B]  =  Rank  (A: B)  +  Rank  B 


=  Rank 


+  Rank  B  • 


The  second  part  of  Theorem  5.2  follows  from  Theorem  5.1. 
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Some  Applications  of  the  Shorted  Matrix 

Recovery  of  interblock  information  in  incomplete 

block 

experiments  [6,9] 

Consider  a  pair  of 

consistent  linear  equations 

Ax  =  a 

(6.1) 

Bx  =  b 

(6.2) 

and  the  combined  equation 

(A  +  B)  x  =  a  +  b  (6.3) 

We  shall  assume  that  M(A)  <=  M(A  +  b)  so  that  the  equations 
(6.3)  may  be  consistent  whenever  (6.1)  and  (6.2)  are  so.  This 
condition  is  satisfied  for  example  when  A  and  B  are  p.s. 
matrices.  The  linear  function  p*x  assumes  a  unique  value  for 
every  solution  x  of  (6.1)  iff 

p  e  M (A* )  (6.4) 

Among  such  linear  functions  we  are  interested  in  identifying 
those  for  which  substitution  of  a  solution  of  (6.3)  or  of  (6.1) 
leads  to identical  answers.  Such  problems  crop  up  in  the  theory 
of  recovery  of  interblock  information  in  incomplete  block  exper¬ 
iments  where  (6.1)  and  (6.2)  are  respectively  the  normal  equa¬ 
tions  for  deriving  intra-  and  inter  block  estimates  and  (6.3) 
is  the  normal  equation  for  deriving  combined  intra-  inter  block 
estimates.  When  S(A|B)  exists. 


a  neat  answer  is  provided  by 
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Theorem  6.1. 

Theorem  6.1.  If  S(A|B)  =  C  exists,  then 

P  (A  +  B)  (a  +  b)  =  p*A-a  Va  e  M (A)  ,  b  e  M (B)  (6.5) 

iff  P  e  M(A*  -  C*)  (6.6) 

Proof:  'if  part'  .  We  note  first  that  M (A*  -  C*)  c  M(A*)  . 

Let  xq  satisfy  (6.1) 

(A  -  C)  (A  +  B)  "(a  +  b) 

=  (A  -  C) (B  +  C  +  A  -  C)“[ (A  +  B)xq  +  b  -  BXq] 

=  (A  -  C) (B  +  C  +  A  -  C ) ~ ( A  +  B)xq 

since  (B  +  C)  and  (A  -  C)  are  disjoint  matrices  and 

b  -  Bxq  e  A((B)  =  ,Vi(B  +  C)  .  The  RHS  further  simplifies  to 

(A  -  C)  (A  +  B)  ”  (A  +  B)  X  =  (A  -  C)  x 

o  o 

since  M  (A  -  C  )  c  M  (A  )  c  M  (A*  +  B*;  . 

'only  if  part'.  (6.5)  ^ 

p* (A  +  B) ~b  =  0  V  be  M(B) 

=>  p*  (A  +  B)“B  =  0  (6.7) 

(6.4)  =?>  p  =  X*A  for  some  X  e  Em  (6.8) 

Substituting  (6.8)  in  (6.7)  we  have 

X*P (A,B)  =  0  (6.9) 

Since  M[P(A,B)]  =  M[S(A|B)],  and  A  e  {C~}  ,  it  follows  that 


25. 


(6.9)  =>  A*C  =  0 

=>  \*  =  y*[  I  -  CA-]  for  some  y  e  Em  .  (6.10) 

Hence  p*  =  A*A  =  y*[I  -  CA~]A  =  y* (A  -  C)  (6.6) 

This  concludes  the  proof  of  Theorem  6.1. 

(b)  Test  of  linear  hypothesis  in  linear  models 

2 

Let  the  random  vector  Y  ~  Nn(Xg,a  I)  where  g  an  m-tuple 

2 

is  an  unknown  parameter  vector  and  a  >  0  is  also  an  unknown 
parameter.  X  is  a  known  matrix. 

Consider  a  hypothesis 

Hq  :  Hg  =  b  (6.11) 

We  shall  assume  that  the  equation  (6.11)  is  consistent.  As  other¬ 
wise  the  hypothesis  could  be  rejected  without  any  formal  statisti¬ 
cal  test.  It  was  shown  in  [15]  that  when  Hg  is  not  estimable, 
only  the  estimable  part  of  (6.11)  could  be  tested.  To  be  more 
precise,  let  K  be  a  matrix  such  that 

M  (H  1  K '  )  =  M(H  '  )  n  M(X  ' ) 

where  '  on  a  matrix  indicates  its  transpose.  Then  one  could  only 
test  if 


KHg  =  Kh  (6.12) 

and  deviations  from  (6.11)  that  do  not  result  in  deviations  from 
(6.12)  will  go  undetected.  In  the  same  paper  the  authors  sug¬ 
gested  an  expression  for  K  .  One  could  alternatively  use 
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K  =  X ' X (H 1 H  +  X'X)  H' 

in  view  of  Theorem  2.2(f). 

We  shall  however  recommend 


K  =  CH 


m(C) 


where  C  =  X'X  is  the  matrix  of  normal  equations  that  provide 
least  squares  estimates  for  the  parameters  6  and  is 

a  minimum  C  seminorm  g-inverse  of  H. 

Observe  that 


KH  =  S(C) 

is  the  shorted  matrix  C  where  S  =  M(H')  [12].  The  shorted 

matrix  is  symmetric  and 

M (KH)  =  M(H'K')  =  M(H')  n  M(C)  =  M(H')  n  M(X') 

A 

Further  the  dispersion  matrix  D(KHB)  of  the  BLUE  of  KHB  is 
given  by 

D (KHg)  =  [S(C)]C~[S(C)]o2 
=  S(C)o2. 

2 

Note  that  C  /o  is  a  g-inverse  of  this  dispersion  matrix.  If 
u  =  S(C)B  -  9  where  g  =  CH~^cjh  ,  we  have  the  following  simple 
formula  for  computing  the  expression  which  appears  in  the  numer¬ 
ator  of  the  variance  ratio  test.  We  note  in  this  connection  that 
under  HQ  , 


27. 


u'C_u/o2  ~  x2 
~  V 

2 

where  is  a  chi  square  random  variable  with  the  degrees 

of  freedom  v  =  Rank  S(C)  =  trace  C  (C)  .  Since  any  routine 
least  square  analysis  of  the  data  would  provide  C  ,  C~  and 
C  C  ,  the  suggested  use  of  the  shorted  matrix  is  more  attrac¬ 
tive  compared  to  alternatives  proposed  earlier.  It  requires 
lesser  number  of  additional  matrix  inversions,  and  as  a  by¬ 
product  gives  Hm(c)  can  be  used  to  test  the  consistency 

of  the  equation  (6.11). 
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